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1. Introduction 

In a preceding article [I] coherence in a relative sense, i. e., understood as a relation 
between a given observable and a given quantum state, was postulated to be identical 
with incompatibility between observable and state as far as its quantity Ic is concerned. 
(For notation see the passage immediately following the proof of Proposition 5 below.) 
Then it was shown that bipartite pure state entanglement is expressible as Ic (with a 
suitable observable). 

Pure states cannot be obtained as mixtures. Therefore, the question if Ic is concave, 
i. e., a genuine entropy quantity, or convex, i. e., a genuine information one, or 
something third, could not be put in this context. The first aim of this study is to 
clarify this point. (This is done in Proposition 5.) To enable this, the mixing property 
of relative entropy (paralleling the mixing property of entropy and Donald's identity for 
relative entropy, see the Remark) is derived. 

In a follow up of the mentioned article [2j the special case of the final bipartite 
pure state | ^)i 2 in repeatable measurement, when the initial state is pure, was 
studied. It was shown that the initial quantity of incompatibility between the measured 
observable and the initial state reappears as the amount of entanglement in \ ip)n, and is 
further preserved when it is shifted in reading the measurement result. This completes 
Vedral's result [3] that the information transfer from object (subsystem 1) to measuring 
apparatus (subsystem 2) does not exhaust the mutual information J 12 in the final state. 

I think it is of interest to find out if the mentioned preservation of the quantity of 
incompatibility between the measured observable and the initial pure state is restricted 
to pure state, or it can be generalized to mixed initial state. This is not a straightforward 
generalization. It requires more knowledge on Ic- The second aim of this study is to 
provide such knowledge, which will be possible due to the mentioned auxiliary relative- 
entropy relations (see section 3). 

In a further preceding article 4J an arbitrary discrete incomplete observable A and 
its completion A c to a complete observable were investigated and it was shown that 
Ic{A, p) < Ic(A c ,p) for any state p. This inequality is expected if the assumption on 
the identity of the amount of coherence and that of incompatibility is correct. But it is 
desirable to evaluate Ic(A c ,p) — Ic(A,p) and thus to try to acquire more insight into 
the nature of Ic- This is the third aim of this article. (See the discussion after the proof 
of the theorem below. ) 

The fourth aim of this paper is to present an argument that starts with the 
mentioned identity assumption and leads to an expression for the quantity of coherence 
in a natural way Will this expression be the same as the ad hoc introduced one? This 
is done in section 2 and an affirmative answer is obtained. It is summed up in the 
conclusion (subsection 5.2.). 

The fifth and last aim of this investigation is perhaps the most important one. 
Namely, in [I] it was established that Ic plays an important role also in some mixed 
bipartite states. This line of research should be continued in a follow up because it may 
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contribute to our understanding how mutual information in general bipartite states 
breaks up into a quasi-classical part and entanglement, which is the object of study of 
a wide circle of researchers, e. g. (5j, [E] To this purpose, one may need more detailed 
knowledge of the properties of Iq. To acquire such knowledge is the fifth aim of this 
article (see section 4). 

1.1. Background in classical statistical physics 

To obtain a background for our quantum study of coherence, we assume that a classical 
discrete variable A(q) = J2i a iXi(l) is given (all a ; 6 R being distinct). The symbol 
q denotes the continuous state variables (as a rule, it consists of twice as many variables 
as there are degrees of freedom in the system); \i are the characteristic functions 
V7 : Xi{o) = 1 if ? G , and zero otherwise. Naturally, A\ are (Lebesgue 
measurable) sets such that A(q) = a\ if and only if q 6 Ai, and — Q, where 

Q is the entire state space (or phase space) and the sum is the union of disjoint sets. 

Let p(q) be a continuous probability distribution in Q with the physical meaning 
of a statistical 'state' of the system. One can think of p(q) as of a mixture 



where V/ : Pi = fa p{q)xi(Q)dq are the statistical weights (probabilities of the 
results ai if A(q) is measured in p(q)), and V/, pi > : Pi(q) = p(q)xi(q) /Pl are 
the 'states' with defmite (or sharp) vàlues of A(q). 

Let B(q) be any other continuous or discrete variable. Then, utilizing (1), its 
average can be written 



One distinguíshes the contributions of the individual eigenvalues a>i of A(q) through the 
terms on the RHS. They contribute to (B) p each separately. 

All this serves only as a classical background to help us to understand the non- 
classical, i. e., purely quantum relations between the analogous quantum entities. 

1.2. Transition to the quantum mechanical case 

The quantum mechanical anàlogues of the mentioned classical entities are the following. 

Discrete observables (Hermitian operators) A = Y^i &1P1 (spectral form in terms of 
distinct eigenvalues), p quantum state (density operator), and B an arbitrary observable 
(Hermitian operator). The quantum average is {B) p = tr(pB). 

In the transition from classical to quantum one runs into a surprise, that is known 
but, perhaps, not sumciently well known. Before we formulate it in the form of a lemma, 
let us introduce the Lüders state pi in order to obtain the quantum anàlogues of 



(1) 
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relations (1) and (2). It is that mixture of states, each with a definite value of A, which 
has a minimal Hilbert-Schmidt distance from the given state p jH]. It is defined as 

i 

where 

V/ : pi = tr(pPi) (36) 

are again the statistical weights in (3a) (or the probabilities of the results ai when A is 
measured in p), and 

VZ, p,>0: p l L = P lP P l lp l (3c) 
are the states with definite vàlues a\ of A Finally, 

i 

Decomposition (3a) is the analogue of (1), and (3d) is that of (2). 
Lemma 1. The following four statements are equivalent: 

(i) The state p cannot be wrítten as a mixture of states in each of which the 
observable A has a definite value. 

(ii) The observable A and the state p are incompatible, i. e., the operators do not 
commute [A, p] ^ 0. 

(iii) The Lüders state pl given by (Sa)-(Sc) is distinct from the original state p. 

(iv) There exists an observable B such that 

{B) P Ï{B) PL , (4) 

where the RHS is given by (3d). 
Proof is given in Appendix 1. 

The physical meaning of lemma 1 is that it defines a kind of quantum coherence 
as a special relation between observable and state. Experiment ally it is exhibited in 
interference. In this relative sense (relation between variable and state) it is lacking 
in classical physics because there a state can always be written as a mixture of states 
in each of which the variable in question has a definite value (negation of (i), cf (1)). 
Though classical waves do exhibit a kind of coherence and show interference, but this 
is in a different sense (cf section 5). 

One should note that the Lüders state needs no other characterization than its role 
in lemma 1 (in particular (iii)). The fact that it is "closest" to p in Hilbert-Schmidt 
mètrics, though actually not important for this study, raises the thought-provoking 
qüestions if "closest" is true also in other mètrics; if not, why is the Hilbert-Schmidt 
mètrics more suitable. 

We take two-slit interference [H] to serve as an illustration for lemma 1. 

Let A be a dichotomic position observable with two eigenvalues: localization at the 
left slit, and localization at the right slit on the first screen. Let p be a wave packet 
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that has just arrived at this two-slit screen. Next, one has to find a suitable observable 
B such that inequality (4) be satisfied at the mentioned moment. Moreover, one wants 
to observe experimentally the LHS of (4), or rather the individual probabilities of the 
eigenvalues of B (that go into the LHS). 

To this purpose, one actually replaces B by another localization observable A' on 
a second screen, to which the photon will arrive some time later. This observable 
is suitable for observation (of its localization probabilities). Hence, one can define 
B = U^A'U, U being the evolution operator expressing the movement of the particle 
from the two-slit screen to the second one. One should note that B is not a position 
observable though A' is because the hamiltonian that generates U contains the kinetic 
energy (square of linear momentum). 

Claim (i) of lemma 1 says that the particle is not moving through either the left or 
the right slit. Claim (ii) expresses the same fact algebraicly. Namely, p, being a pure 
state | ip){ip |, would commute with A only if | v/j) lay in an eigensubspace of A. In our 
case this would mean that the particle traverses one of the slits. 

The Lüders state pi is, in some sense, the best approximation to p of a state 
traversing one or the other of the slits. Naturally, p ^ Pl as claimed by (iii). Claim 
(iv), i. e., relation (4), amounts to the same as the fact that the interference pattern on 
the second screen is not equal to the sum of those that would be obtained when only one 
of the slits were open (for some time) and then the other (for another, disjoint, equally 
long time). 

In the two-slit experiment one actually observes the time-delayed equivalent of (4): 

(5) 

Since the LHS of (5) is distinct from the RHS, one speaks of the former as interference. 
In the described two-slit case the LHS of (5) gives fringes, whereas the RHS does not. 
Nevertheless, it is not always true that the LHS of (5) itself means interference. This 
is the case only with a suitable pair of A and p (cf (ii) in lemma 1). Let me give a 
counterexample. 

Let us take another two-slit experiment in which the slits have polarizers that 
give opposite linear polarization to the light passing the slits ^Oj- The state p in 
the slits is then such that we have equality in (5) (though A' is the same), and there 
is no interference because [A, p] = 0. (The state p —\ ip)(ip | is now in the composite 
spatial-polarization state space, and the spatial subsystem state - the reduced statistical 
operator - is a Lüders state.) 

One should note that when interference is displayed, one has three ingredients: the 
state p, the observable A the two eigenvalues of which play a cooperatíve role, and the 
second observable A' the probabilities of eigenvalues of which are observed. Since in 
theory there can be many observables like A', or B in (4), one likes to omit them. Then 
one speaks of coherence of the observable A in the state p. We make use of the same 
concepts in the general theory. 
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Definition 1. The LHS of relation (4), in case inequality (4) is vàlid, is called 
interference. If an observable A and a state p stand in such a mutual relation that any 
of the four claims of lemma 1 is known to be vàlid, then one speaks of coherence. 

One should note that the concepts of interference and of coherence stand in a 
peculiar relation to each other: There is no coherence (between A and p) unless an 
observable B that exhibits interference can be, in principle, found; if the latter is the 
case, and only then, one may forget about B, and concentrate on the relation between 
A and p, i. e., on coherence. The kind of quantum coherence investigated in this paper 
can be more fully called "eigenvalue coherence of an observable in relation to a state" 
in view of the cooperative role of some eigenvalues (or, more precisely, their quantum 
numbers, because the vàlues of the eigenvalues play no role) as seen in (4). 

Thus, any of the four (equivalent) claims in lemma 1 defines coherence. But for the 
investigation in this article the important claim is (ii): coherence exists if and only if A 
and p do not commute. This remark is the còrner stone of the expounded approach to 
investigat ing coherence (as in the preceding studies |T], |lj). 

2. How to obtain a quantum measure of coherence? 

We start with the assumption that coherence of an observable A with respect to a state 
p is essentially the same thing as incompatibility of A and p: [A, p] ^ 0. The quantum 
measure will be called coherence or incompatibility information, and it will be denoted 
by I c (A,p) or shortly I c (cf (10) below). 

One wonders what the meaning of a larger value of Ic for coherence is. It is more 
of what? The only answer I can think of is in accordance with the above assumption: 
More of incompatibility of A and p. 

The next question is: Do we know what is a " larger amount of incompatibility" ? 

The seminal review on entropy of Wehrl [Q] (section III. C there) explains that each 
member of the Wigner-Yanase-Dyson family of skew informations 

I p (p, A) = -S p (p, A) = (l/2)tr([/, A] [p 1 ^, A]), < p < 1, (6) 

is a good measure of incompatibility of p and A. Namely, I p (p, A) is positive unless p 
and A commute, when it is zero. It is also convex as an information quantity should be. 
Substituting the spectral form of A in (6), one obtains 

I p = (l/2)tr(£ £ ai \f, PMp 1 -?, Pi,)), 
i v 

One can see that I p depends on the eigenvalues of A. 

As well known, A and p are compatible if and only if all eigenprojectors Pi of the 
former are compatible with the latter. The eigenvalues of A do not enter this relation. 
Hence, I p (p, A) given by (6) is not the kind of incompatibility measure that we are 
looking for. One wonders if there is any other kind. 
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To obtain an answer, we turn to a neighboring quantity: the quantum amount of 
uncertainty of A in p. It is the entropy S(A, p): 

S{A,p)=H{ Pl ), (7a) 

where H(pi) is the Shannon entropy 

H(pi) = -^Pilogpi, (76) 
i 

and 

V/: Pl = ti(Pip). (7c) 

It is known that whenever A and p are incompatible, and A is a complete observable, 
i. e., if all its eigenvalues are nondegenerate (we'll write it as A c ), then always 
S(A c ,p) > S(p). When A c is compatible with p, the two quantities are equal. The 
interpretation that the larger the difference S(A c ,p) — S(p), the more incompatible A c 
and p are seems plausible. Hence, we require for complete observables A c , that Ic{A c , p) 
should equal this quantity: Ic(A c ,p) = S(A C , p) — S(p). Equivalently, one can require 
that the following peculiar decomposition of the entropy in case of a complete observable 
should hold: 

S(p) = S(A c ,p)-I c (A c ,p). (8) 

On the other hand, if A is a discrete observable that is complete or incomplete but 
compatible with p, then the following decomposition parallels (8): 

S{p) = S(A,p) + y £piS(PipPi/pi) (9) 
i 

(cf (7a), (7b) and (7c)). If pi = 0, the corresponding term in the sum is by definition 
zero. 

Decomposition (9) is obtained by application of the mixing property of entropy [11 
(see Sections II. F. and II. B. there). It applies to orthogonal state decomposition, in this 
case to p = T,iPi{PipPi/Pi), and it reads S(p) = H( Pl ) + J2iPiS(PipPi/pi) (cf 
(7b)). 

The coherence information Ic does not appear in (9). This is as it should be because 
it is zero due to the assumed compatibility of A and p. 

In case of a general discrete A, which is complete or incomplete, compatible with 
p or not, we must interpolate between (8) and (9). This can be done by observing that 
both decompositions can be rewritten in a unified way as 

I c (A,p)=S(J2PipPi)-S(p) (10) 
l 

(vàlid for either A = A c or for [A, p] = 0). The searched for interpolated formula should 
thus be the same relation (10), but vàlid this time for all discrete A. Thus, Iq(A, p) is 
obtained by the presented argument. 
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Making use of the mixing property of entropy, we can rewrite (10) equivalently as 
the following general decomposition of entropy: 

S(p) = S(A,p) +J2PiS(PipPi/Pi) - Ic(A,p). (11) 
i 

(Note that A is any discrete observable in (11).) 

In order to derive a number of properties of coherence information, we make a 
deviation into relative entropy theory. 

3. Useful relative-entropy relations 

The relative entropy S(p\\a) of a state (density operator) p with respect to a state a is 
by definition 

S(p\\a) = tv[plog(p)} - tv[plog(a)] (12a) 

if supp(p) Ç supp(a); (126) 

or else S(p\\a) = +oo (see p. 16 in [12]). By 'support', denoted by 'supp', is meant 
the subspace that is the topological closure of the range. 

If a is singular and condition (12b) is vàlid, then the ort ho complement of the 
support (i. e., the null space) of p, contains the null space of o~, and both operators 
reduce in supp(cr). Relation (12b) is vàlid in this subspace. Both density operators 
reduce also in the null space of a. Here the log is not defined, but it comes after zero, 
and it is generally understood that zero times an undefmed quantity is zero. We'll refer 
to this as the zero convention. 

The more familiar concept of (von Neumann) quantum entropy, S(p) = 
— tr[plog(p)\, also requires the zero convention. If the state space is infinite dimensional, 
then, in a sense, entropy is almost always infinite (cf p.241 in JI]). In finite-dimensional 
spaces, entropy is always finite. 

There is an equality for entropy that is much used, and we have utilized it, the 
mixing property concerning orthogonal state decomposition (cf p. 242 in [TT]): 

a = ^2w k o- k , (13) 

k 

VA; : w k > 0; for w k > 0, a k > 0, tra k = 1; VA: ^ k' : o~ k o k i = 0; J2k w k = 1- Then 
S(a) = H(w k )+J2k w k S(a k ), H(w k ) = - T,k[wklog(w k )] being the Shannon entropy 
of the probability distribution {w k : VA:}. 

The first aim of this section is to derive an analogue of the mixing property of 
entropy. The second aim is to derive two corollaries that we shall need in this paper. 

We will find it convenient to make use of an extension log e of the logarithmic 
function to the entire real axis: if < x : log e (x) = log(x) , if x < : 
log e (x) = . 

The following elementary property of the extended logarithm will be utilized. 
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Lemma 2. If an orthogonal state decomposition (13) is given, then 

i i 
log e (a) = Y} l °9{wk)]Qk + E log e ((T k ), (14) 
k k 

where Qk is the projector onto the support of o~k, and the prim on the sum means that 
the terms corresponding to Wk = are omitted. 

Proof. Spectral forms VA;, Wk > : — J2i k s i k I h)(h I ( an s i k positive) give 
a spectral form a = J2k J2i k w kSi k \ h)(h \ of a on account of the orthogonality assumed 
in (13) and the zero convention. Since numerical functions define the corresponding 
operator functions via spectral forms, one obtains further 

log e (a) =J2J2l l °9 e ( w kSi k )] \lk){lk\=Y.Y\ lo 9( w k) + log{si k )} \l k ){h\ = 

k Ifc k Ijç 

k k l jç 

(In the last step Qk = J2i k I h){h \ for Wk > was made use of.) The same is obtained 
from the RHS when the spectral forms of o~k are substituted in it. □ 
Proposition 1. Let condition (12b) be vàlid for the states p and a, and let an 
orthogonal state decomposition (13) be given. Then one has 

S(p\\a) = S^YjQkpQk) ~ S(p) + H(p k \\w k ) + J2Pk S (QkpQk/Pk\Wk), (15) 

k k 

where, for Wk > 0, Qk projects onto the support of a k , and Q k = if Wk = 0, 
p k = tr(pQk), and 

H(pk\\w k ) = Y^\Pklog{pk)] - Y^\Pklog{w k )} (16) 

k k 

is the classical discrete counterpart of the quantum relative entropy, vàlid because 
(p k >0) ^(w k >0). 

One should note that the claimed validity of the classical analogue of (12b) is 
due to the defmitions of pk and Qk- Besides, (13) implies that (J2kQk) projects onto 
supp(a). Further, as a consequence of (12b), {J2kQk)p = P- Hence, tr( J2k QkPQk) = 
tr(£ fc QfcP) = l- 

We call decomposition (15) the mixing property of relative entropy. 
Proof of proposition 1: We define 

VA;, p k > : p k = QkpQk/Pk- (17) 
First we prové that (12b) implies 



Vfc, Pk > : supp(p fc ) Q supp(o-fc). 



(18) 
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Let k, p k > 0, be an arbitrary fixed value. We take a pure-state decomposition 

p = ^\ n \4>n)(lpn\ (19 a), 

n 

Vri : À n > 0. Applying Qk - Qk to (19a), one obtains another pure-state decomposition 

QkpQk = PkPk = ^Qk | Ipn) (?Pn \ Qk (196) 
n 

(cf (17)). Let Q k be a nonzero vector appearing in (19b). Since (19a) implies that 
\ipri) £ supp(p) (cf Appendix 2 (ii) ) , condition (12b) further implies \i[) n ) G supp(cr). Let 
us write down a pure-state decomposition 

^ = E A U<U(<U (20) 

m 

with | (f>i) =| ip n ). (This can be done with > cf [TÜJ . ) Then, applying Q k ...Q k to 
(20) and taking into account (13), we obtain the pure-state decomposition 

QkCrQk = W k (T k = J2^'mQk \ 4>m){4>m \ Qk- 
m 

(Note that w k > because p*,. > by assumption.) Thus, Q k \i^ n ) = Qk \<fii) G supp(afc). 
This is vàlid for any nonzero vector appearing in (19b), and these span supp(p^) (cf 
Appendix 2(ii)). Therefore, (18) is vàlid. 

On account of (12b), the Standard logarithm can be replaced by the extended one 
in definition (12a) of relative entropy: S(p\\cr) = —S(p) — tr [plog e (cr)] . Substituting 
(13) on the RHS, and utilizing (14), the relative entropy S(p\\a) becomes 

-S(p)-tr{p[Y^[log(w k )]Q k +J2[log e ((Tk)]]} = -S(p)-J2[Pklog(w k )}-J2tr[plog e (a k )}. 

k k k k 

Adding and subtracting H(p k ), replacing log e (a k ) by Q k [log e (a k )]Q k , and taking into 
account (16) and (17), one further obtains 

S(p\\a) = -S(p) + H(p k ) + H(p k \\w k ) - J2Pk^[pklog e (a k )]. 

k 

(The zero convention is vàlid for the last term because the density operator Q k pQ k /p k 
may not be defined. Note that replacing J2k by J2'k i n (16) does not change the LHS 
because only p k = terms are omitted.) 

Adding and subtracting the entropies S(p k ) in the sum, one further has 

3(p\\<r) = -S(p) + H(p k ) + H(p k \\w k ) + J2Pk S (Pk) + J2Pk{~ S ÍPk) - ti[p k log e (a k )]}. 

k k 

Utilizing the mixing property of entropy, one can put S(^J2kPkPkj instead of [H(p k ) + 
J2'kPkS(pk)}- Owing to (18), we can replace log e by the Standard logarithm and thus 
obtain the RHS (15). □ 
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Remark. In a sense, (15) runs parallel to Donald's identity 

S (p\W) = J2Pk S (pk\\°~) ~ H(p k ), 
k 

when an orthogonal decomposition p = J2kPkPk of the first state p in relative entropy is 
given. 

For a general decomposition p = J2kPkPk of the first state Donald's identity reads 

S(P\W) = J2Pk S (pk\W) ~ J2Pk S (pk\\p) 

k k 

[T%] . [To] (relation (5) in the latter). The more special relation in the remark follows 
from this on account of the relation that generalizes the mixing property of entropy: If 
P — HkPkPk is any state decomposition, then 

S{p) = Y.Pk S (pk\\p) +Y.Pk S (pk) 

k k 

is vàlid (cf Lemma 4 and Remark 1 in [T6lj ) . 

Now we turn to the derivation of some consequences of proposition 1. 

Let p be a state and A = J2i a %P% + J2j a jPj a spectral form of a discrete observable 
(Hermitian operator) A, where the eigenvalues and üj are all distinct. The index i 
enumerates all the detectable eigenvalues, i. e., Vz : tr(pPj) > 0, and tr[p(J2i Pi)] — 1- 

The simplest quantum measurement of A in p changes this state into the Lüders 
state: 

p L {A) =Y,PiPPi (21) 

i 

(cf (3a) and (3c)). Such a measurement is often called "ideal". 

Corollary 1. The relative- entropic "distance" from any quantum state to its 
Lüders state is the difference between the corresponding quantum entropies: 

s( P \\J2PipPi) = s(Y,Píppï) - s(p). 

i i 

Proof. First we prové that 

supp(p)Çsupp(^P lP P í ). (22) 

i 

To this purpose, we write down a decomposition (19a) of p into pure states. One has 
supp(X^Pi) ^ su PP(p) (equivalent to the certainty of (X^P) in p, cf jlj), and the 
decomposition (19a) implies that each | ip n ) belongs to supp(p) (cf Appendix 2(ii)). 
Hence, \ip n ) G supp(X^Pi); equivalently, \ip n ) = (Y^Pi) l^n)- Therefore, one can write 

Vn: \i, n ) = Y.{Pi\A í ))· (23a) 

i 

On the other hand, (19a) implies 



£ P iP Pi = E E X nPi I Vn) (Vn I Pi- (236) 

i i n 
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As seen from (23b), all vectors (Pj | ip n )) belong to supp(J]j PipPi)- Hence, so do all 
| ip n ) (due to (23a)). Since p is the mixture (19a) of the \ip n ), the latter span supp(p) 
(cf Appendix 2(ii)). Thus, finally, also (22) follows. 

In our case u = J2i PipPi in (15). We replace k by i. Next, we establish 

V* : Q lP Q t = P iP P % . (24) 

Since Qi is, by definition, the support projector of (P^pP^, and Pi(PipPi) = (PipPi), one 
has PíQí = Qi (see Appendix 2(i)). One can write PipPi = Qi(PipPi)Qi, from which 
then (24) follows. 

Realizing that Wi = tv(QipQi) = tr (PipPi) = pi due to (24), one obtains 
H(pí\\wí) = and Vi : S(QipQi/pi\\PipPi/wi) =0 in (15) for the case at issue. 
This completes the proof. □ 

Now we turn to a peculiar further implication of Corollary 1. 

Let B = J2kJ2i k bki k Pki k be a spectral form of a discrete observable (Hermitian 
operator) B such that all eigenvalues b^ k are distinct. Besides, let B be more complete 
than A or, synonymously, a refinement of the latter. This, by definition means that 

V/c : P k = Y J Pu k (25) 

is vàlid. Here k enumerates both the i and the j index vàlues in the spectral form of A. 

Let Pl(A) and Pl(P>) be the Lüders states (21) of p with respect to A and B 
respectively. 

Corollary 2. The states p, pi(A), and Pl{B) lie on a straight líne with respect 
to relatíve entropy, i. e., S(p\\p L (B)^ = S(p\\p L (A)^ + S(p L (A))\\p L (B)^ , or 
explicitly: 

Note that all eigenvalues bki k of B with indices others than Ui are undetectable in 

P- 

Proof. Corollary 1 immediately implies 

S(p\\p L (B)) = [S( PL (B)) - S( PL (A))} + [S( PL (A)) - S(p) , 

and, as easily seen from (21), p L {B) = (p L (A)) (B) due to P ih P v = S^P^ (cf (25)). 

□ 



4. Properties of coherence information 



To begin with, we notice in (10) that Iq depends on p and A, actually only on the 
eigenprojectors of the latter. 
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As a consequence of (10), one can also write the definition of Ic in the form of a 
relative entropy: 

Ic = S(p\\Y,PipPi) (26) 
i 

as follows from corollary 1. 

It was proved long ago [Ej that S(^J2iPipPi) > S(p) if and only if A and p are 
incompatible, and the two entropies are equal otherwise. Thus, in case of compatibility 
[A, p] = 0, Ic is zero, otherwise it is positive. This is what we would intuitively expect. 

It was proved in 0] (theorem 2 there) that 

inc inc inc 

Ic = W mc I C (Y. a l P U (E P 0p(E P 0Mnc), (27) 

l l l 

where "inc" on the sum denotes summing only over all those vàlues of / the corresponding 
Pi of which are incompatible with p, and w inc = tr(^X)| nc Pj). 

This corresponds to an intuitive expectation that the quantity Ic should depend 
only on those eigenprojectors Pi of A that do not commute with p, and not at all on 
those that do. 

We obtain (27) as a special case of a much more general result below (cf the theorem 
and propositions 2 and 3). 

We shall need another known concept. For the sake of precision and clarity, we 
define it. 

Definition 2. One says that a discrete observable A = ^Z m a m P m (spectral form in 
terms of distinct eigenvalues a m ) is coarser than or a coarsening of A = J2i a iPi tf there 
is a partitioning II in the set {l : V7} of all index vàlues of the latter 



m i 

m 



n : {l : vz} = E Q 

such that 

Vm : P m = Pi 

(C m are classes of vàlues of the index l, and the sum is the union of the disjoint classes). 
One also says that A is finer than or a refinement of A. 

Theorem. Let A be any coarsening of A (cf definition 2). Then 

I C (A, P ) = I C (A, P ) + E [p m Ic(PmA, PmpPjPm)] , (28) 

m 

and Vm : p m = tr(pP m ). (If p m = 0, then, by the zero convention, the corresponding 
Ic in (28) need not be defined. The product is by definition zero.) 

Before we prové the theorem, we apply corollary 2 to our case. 

Under the assumptions of the theorem, one has 

S( P \ | BVl)) =S( P \\J2(PmpPm)) +S(j2(Pm P Pm) \\ (29) 
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Proof of the Theorem. On account of (26), (29) takes the form 

Ic(A,p)=I c (A,p) + I c (A,Y / (P mP P m )). (30) 

m 

Utilizing (10) for the second term on the RHS, the latter becomes S(j2i(PipPi)) ~ 
S(j2m{PmpPm)^) ■ Making use of the mixing property of entropy in both these terms, 
and cancelling out H{p m ) (cf (7b) mutatis mutandis), this difference, further, becomes 
E m PmS[(EieCm PiP p i)/Pm) - Em PmS (P m pPm/ Pm)) ■ Its substitution in (30) with the 
help of (10) (and defmition 2) then gives the claimed relation (28). (Naturally, one must 
be aware of the fact that À is a coarsening of A, hence Vm : [P m , A] = 0, implying 

A = X^m Sm' Pm.AP m ' X/m Pm.A.) d 

If À is any coarsening of A, then the index vàlues m of the former replace classes 
C m of index vàlues l of the latter. Hence, coherence in A - as a cooperative role of 
index vàlues - must be poorer than in A. Therefore, one would intuitively expect that 
Ic(A, p) must not be larger than Ic(A, p). The theorem confirms this, and telis more: it 
gives the expression by which Ic(A, p) exceed Ic(A, p). One wonders what the intuitive 
meaning of this is. 

Discussion of the theorem. Let us think of p as describing a laboratory 
ensemble, and let us imagine that an ideal measurement of A is performed on 
each quantum system in the ensemble. The ensemble p is then replaced by the 
mixture J2mPm(PmpPm/Pm) of subensembles (P m pP m /Pm)- One can think of 
the measurement of the more refined observable A as taking place in two steps: the 
first is the mentioned measurement of the coarser observable A, and the second is a 
continuation of measurement of A in each subensemble (P m pP m /Pm)- Let us assume 
additivity of Ic in two-step measurement. 

Further, let us bear in mind that, though Iq is meant to be a property of 
each individual member of the ensemble p, it is statistical, i. e., it is given 
in terms of the ensemble. Finally, in the second step we have an ensemble of 
subensembles (a superensemble). Since our system is anywhere in the entire ensemble 
J2mPm(P m pPm/Pm) of the second step, one must average over the superensemble with 
the statistical weights p m of its subensemble-members (P m pP m /Pm)- 

If m! 7^ m, then the part P m >A of A — J2 m " P m "A is evidently undetectable 
in the subensemble p m . Hence, only P m A is relevant from the entire A, i. e., Ic(A, p) 
reduces to I c (P m A, p rn ) there. 

In this way one can understand relation (28). What have we learnt from this? It 
is that Ic is additive and statistical. This conclusion is in keeping with the neighboring 
quantity S(A,p). Namely, one can easily derive a relation similar to (28) for it: 

S(A,p) = S(A,p) + J2p m S(P m A,P m pP m / Pm ). 

m 

That Ic and S(A, p) behave equally in an additive and statistical way is no surprise 
since they are terms in the same general decomposition (11) of the entropy S(p) of the 
state p. 
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The theorem is a substantially stronger form of a previous result (theorem 3 in jlj), 
in which I c (A,p) > I c (A,p) was established with necessary and suficient conditions 
for equality, which are obvious in the theorem. (Ic was denoted by Eq in previous work, 
cf my comment following proposition 5 below.) 

The theorem has the following immediate consequences. 

Proposition 2. // the coarsening À defined in definition 2 is compatible with p, 
then (28) reduces to 



Proposition 3. Let us define a coarsening II (cf definition 2) that partitions 
{l : V/} into at most three classes: C inc comprising all index vàlues l for which ai is 



of all l for which a t is detectable and P\ is compatible with p, and, finally, C un d which is 
made up of all l for which ai is undetectable. The coarsening thus defined is compatible 
with p, and (31) reduces to (27). 

Proof. In the coarsening II of proposition 3 the index m takes on three 'vàlues': 
'inc', 'comp', and 'und'. It is easily seen that the coarser observable A thus defined is 
compatible with p. Hence, (31) applies. Further, the second and third terms are zero. 
In this way, (27) ensues. □ 

Proposition 4. Coherence information Iq is unitary invariant, i. e., Ic{A,p) = 
IciUAU^UpU^), where U is an arbitrary unitary operator. 

Proof. Relative entropy is known to be unitary invariant. On account of (26), so 
is I c . □ 

This is as it should be because Ic should not depend on the basis in the state space: 
\JAU~ 1 and U pü~ x can be understood as A and p respectively viewed in another basis. 

Proposition 5. Coherence information Ic is convex. 

Proof. This is an immediate consequence of the known convexity of relative entropy 
(cf (26)) under joint mixing of the two states in it. 

On account of convexity we know that Ic is an information entity, and not an 
entropy one (or else it would be concave). In previous work pQ, [3], [2] the same 
quantity (the RHS of (10)) was erroneously denoted by Ec(A,p) and treated as an 
entropy quantity. But this does not imply that any of the applications of E c (A,p) was 
erroneous. All one has to do is to replace this symbol by Ic{A, p) and keep in mind that 
one is dealing with an information quantity. 

5. Conclusion 

Perhaps it is of interest to comment upon the more Standard uses of the term " coherence" 
in the literature. 

One encounters the bàsic use of the word "coherence" in the properties of light 
waves. One distinguishes two types of coherence there: (i) Temporal coherence, which 




(31) 



detectable (i. e., of positiu e probability) and Pi is incompatible with p, C comp consisting 
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is a measure of the correlation between the phases of a light wave at different points 
along the direction of propagation, and (ii) spatial coherence, which is a measure of 
the correlation between the phases of a light wave at different points transverse to the 
direction of propagation. (The fascinating phenomenon of holography requires a large 
measure of both temporal and spatial coherence of light.) 

Quantum "coherence" refers also to large numbers of partides that cooperate 
collectively in a single quantum state. The best known examples are superfluidity, 
superconductivity, and làser light, all macroscopic phenomena. In the last example 
different parts of the làser beam are related to each other in phase, which can lead to 
interference effects. "Coherence" is often related to different kinds of correlations, see, 

e. g., ng. 

In all mentioned examples "coherence" refers to an absolute property of the quantum 
state of the system; in contrast with the use of the term in this article, which expresses 
a relative property: relation between observable and state. As it was mentioned, the 
kind of quantum coherence studied in this article can be more fully called "eigenvalue 
coherence of an observable in relation to a state" in view of the cooperative role of the 
eigenvalues (or rather their quantum numbers, because the vàlues of the eigenvalues 
play no role) as seen in (4). 

In the literature one often finds the claim that quantum pure states are coherent. 
From the analytical point of view of this article one can say that a pure state | ip) is not 
coherent with respect to any observable for which | ip) (0 1 is an eigenprojector. But it is 
coherent with respect to all other observables. 

5.1. On generality of the results 

A question may linger on to the end of this study: What if the observable is not a 
discrete one? Can one still speak of eigenvalue coherence in relation to a given state p? 

It seems to me that the answer is that one should write down the following partial 
spectral form of a general observable A'\ 

A' = Y,a l Pi + P ± A'P ± , 
i 

where the summation goes over all eigenvalues of A', and P = Y^iPi- One should take 
the discrete coarsening A of A': 

A = Y,^iPi + aP ± , 
i 

where the eigenvalue a is arbitrary but distinct from all {ai : V/}. Then the expounded 
eigenvalue coherence theory should by applied to A, and it should be vàlid for A' (as 
the best we can do for the latter). In a preceding article jl] the case when P 1 - ^ with 
the eigenvalue a undetectable was studied. 

One has eigenvalue coherence of a general observable A' in relation to a state p if 
either A' has at least two eigenvalues or if A' has at least one eigenvalue and P ^ 0. 
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Another question that may linger on is whether the state p that was used in this 
paper is really general. If p has an infinite-dimensional range and A has infinitely- 
many eigenvalues, it may happen that there are infinitely-many detectable ones. The 
expounded theory covers also this case. 

5.2. Summing up 

In an attempt to understand the essential features of two-slit interference (see lemma 1 
followed by its application to two-slit interference in subsection 1.2), a general coherence 
theory was developed based on the assumption that 'coherence' equals 'incompatibility' 
[A, p] between observable and state. Since this relation means that p is incompatible 
with at least one eigenevent (eigenprojector) Pi of A, and this property is independent 
of the eigenvalues, it was argued that the entire family of observables with one and the 
same decomposition of the identity YjiPi — I (the latter is called " closure relation" 
if A is complete) should have the same amount of incompatibility. This discarded 
the Wigner-Yanase-Dyson family of skew informations (6). Further, it was argued 
that the necessarily nonnegative quantity S(A C , p) — S(p) was a natural measure of 
incompatibility between a complete observable A c and the state p satisfying the stated 
claim. Finally, interpolating between the case of a complete and that of a compatible 
observable (see (8), (9) and (10)), the general expression (10) was obtained. 

Thus, a natural quantum measure of how much of coherence, and, equivalently, 
incompatibility, there is if a discrete observable A = J2i a iPi an d a state p are given 
was derived along the expounded argument. It was called coherence or incompatibility 
information (denoted by Ic(A, p) or shortly Ic) in section 2. 

A deviation into a general relative-entropy investigation was made in section 3. 
What was called 'the mixing property of relative entropy' (parallelling that of entropy) 
was derived, and so were two corollaries. 

The relative-entropy results were utilized to express coherence information Ic(A, p) 
in the form of a relative entropy (cf (26)) in section 4. Connection between the coherence 
information Ic(A, p) of any coarsening A (cf definition 2) of an observable A and Ic{A, p) 
was obtained in the theorem. Its intuitive meaning was discussed. It was concluded that 
Ic is additive in two-step measurement and statistical. 

The corresponding relation took a much simpler form in case A was compatible 
with p (cf proposition 2). In a special case of this a result from previous work was 
recognized (cf proposition 3 and (27)). Coherence information was shown to be unitary 
invariant (proposition 4) and convex (proposition 5). 

In previous work pQ, jlj, [2] the coherence information Ic was successfully utilized in 
analyzing bipartite quantum correlations. The last one of them filled in an information- 
theoretical gap noted in preceding investigation of the measurement process |3j. 

Since a number of new properties of Ic have now been obtained, even more fruitful 
applications can be expected. 
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We prové the equivalence of the negations of the four claims in lemma 1. ("-> (i)" 
is the negation of (i) etc, and is the claim of The logical scheme of the 

proof is: -i (ii) <=ï -i (iii) <=>■ -i (iv); -i (ii) =>- -i (i) =>• -i (iii). 

-i (ii) -i (iii): One can always write p = J2i J2i> PipPv • Since A and p commute 
if and only if each eigenprojector Pi of A commutes with p, the claimed equivalence is 
obvious. 

(-i (iii) -i (iv)) is obvious. To prové ^-i (iv) =>> -i (iii)), we restrict the operators 
B to ray projectors |o)(a|. Then -i (iv) implies tr(p \a)(a\) = (a \ p \a) = (a \ Pl \a) for 
every state vector \a). But then, as well known, one must have p = pi, which is -> (iii). 

-i (ii) -i (i): In view of p = J2iY^i' PipPi'i commutation of p with each P\ 

implies -i (i). 

-i (i) (=>) -i (iii): Let us assume that p = J2iPiPh an d that each state pi has the 
sharp value of the corresponding eigenvalue a\ of A. Then pi = P\piP\ (cf lemma A.4. 
in [Hj). Substituting this in the state decomposition, and subsequently evaluating p L 
according to (3a)- (3c), one can see that -i (iii) follows. □ 

Appendix 2. 

Let p = J2 n \ n )( n \ be an arbitrary decomposition of a density operator into ray 
projectors, and let E be any projector. Then 

Ep = p Vn : E\n)=\n) (Al) 

(cf Lemma A.l. and A.2. in [20]). 

(i) If the above decomposition is an eigendecomposition with positive weights, then 
J2 n \ n )( n \= Qi Q being now the support projector of p, and, on account of (A.l), 

Ep = p => EQ = Q. (A.2) 

(ii) Since one can always write Qp = p, (A.l) implies that all | n) in the arbitrary 
decomposition belong to supp(p). Further, defming a projector F so that supp(F) = 
span({| n) : Vn}), one has FQ = F. Equivalence (A.l) implies Fp = p. Hence, (A.2) 
gives QF = Q. Altogether, F — Q, i. e., the unit vectors {|n) : Wn} span supp(p). 
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